Statistical Applications in Genetics and Molecular Biology

نویسندگان

  • Mark J. van der Laan
  • Merrill D. Birkner
  • Alan E. Hubbard
چکیده

Simultaneously testing a collection of null hypotheses about a data generating distribution based on a sample of independent and identically distributed observations is a fundamental and important statistical problem involving many applications. In this article we propose a new resampling based multiple testing procedure asymptotically controlling the probability that the proportion of false positives among the set of rejections exceeds q at level alpha, where q and alpha are user supplied numbers. The procedure involves 1) specifying a conditional distribution for a guessed set of true null hypotheses, given the data, which asymptotically is degenerate at the true set of null hypotheses, and 2) specifying a generally valid null distribution for the vector of test-statistics proposed in Pollard & van der Laan (2003), and generalized in our subsequent article Dudoit, van der Laan, & Pollard (2004), van der Laan, Dudoit, & Pollard (2004), and van der Laan, Dudoit, & Pollard (2004b). Ingredient 1) is established by fitting the empirical Bayes two component mixture model (Efron (2001b)) to the data to obtain an upper bound for marginal posterior probabilities of the null being true, given the data. We establish the finite sample rational behind our proposal, and prove that this new multiple testing procedure asymptotically controls the wished tail probability for the proportion of false positives under general data generating distributions. In addition, we provide simulation studies establishing that this method is generally more powerful in finite samples than our previously proposed augmentation multiple testing procedure (van der Laan, Dudoit, & Pollard (2004b)) and competing procedures from the literature. Finally, we illustrate our methodology with a data analysis.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Strategies and Clinical Applications of Next Generation Sequencing

Abstract DNA sequencing is one of the great valuable techniques in molecular biology, which can be used to detect the sequence of nucleotides in a DNA fragment. The high-throughput se­quencing known as Next Generation Sequencing (NGS) revolutionized genomic research and molecular biology; therefore, the whole human genome can be sequenced with a low cost in several days. NGS technology is simi...

متن کامل

Strategies and Clinical Applications of Next Generation Sequencing

Abstract DNA sequencing is one of the great valuable techniques in molecular biology, which can be used to detect the sequence of nucleotides in a DNA fragment. The high-throughput se­quencing known as Next Generation Sequencing (NGS) revolutionized genomic research and molecular biology; therefore, the whole human genome can be sequenced with a low cost in several days. NGS technology is simi...

متن کامل

SLC2A4 Polymorphisms Can Be a New Molecular Biomarker for Sports Genomics

"SLC2A4 Polymorphisms Can Be a New Molecular Biomarker for Sports Genomics" is an "Editorial Article" and hasn't abstract.

متن کامل

Statistical Applications in Genetics and Molecular Biology

This note is a comment on the article “Dimension Reduction for Classification with Gene Expression Microarray Data” that appeared in Statistical Applications in Genetics and Molecular Biology (Dai et al., 2006).

متن کامل

Expression Analysis of PKS13, FG08079.1 and PKS10 Genes in Fusarium graminearum and Fusarium culmorum

Background: Identification and quantification of mycotoxins produced by Fusarium species are important in controlling fungal diseases. Objectives: Potential of zearalenone, butenolide and fusarin C production was investigated in five Fusarium graminearum and five F. culmorum isolates at molecular level. Materials and Methods: Presence of PKS13, FG08079.1 and PKS10 genes, associated with produ...

متن کامل

Molecular Epidemiology of Breast Cancer among Iranian-Azeri Population based on P53 Research

Background: This study was done in order to enhance our understanding about molecular and epidemiological features of breast cancer among the Azeri population with special emphasis on the detection of TP53 mutations. We also analyzed the role of the P53codon72 polymorphism (rs1042522) and its role in susceptibility to breast cancer. Methods: ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003